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ro! . Abstract 

We consider an optimal transportation problem with more than 
two marginals. We use a family of semi-Riemannian metrics derived 
^ . from the mixed, second order partial derivatives of the cost function to 

CN ' provide upper bounds for the dimension of the support of the solution. 

^^ : 1 Introduction 

o, 

O . The optimal transportation problem (with two marginals) asks what is the 

most efficient way to transform one distribution of mass to another relative 
to a given cost function. The problem was originally posed by Monge in 1781 

/\ • [19j. In 1942, Kantorovich proposed a relaxed version of the problem |11] : 



c^ ■ roughly speaking, he allowed a piece of mass to be split between two or more 

target points. Since then, these problems have been studied extensively by 
many authors and have found applications in such diverse fields as geom- 
etry, fluid mechanics, statistics, economics, shape recognition, inequalities, 
meteorology, etc. 
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Here we study a multi-marginal generalization of the above; how do we 
align m distributions of mass with maximal efficiency, again relative to a 
prescribed cost function. Precisely, given Borel probability measures /Xj on 
smooth manifolds M, of respective dimensions rii, for i = l,2...,m and a 
continuous cost function c : Mi x M2 x .... x M^ — )■ K, we would like to 
minimize 



(7(7) = / c{Xi,X2,...,X,n)d'J 

JMiXM2...XMm 

among all measures 7 on Mi x M2... x M^ which project to the //, under the 
canonical projections; that is, for any Borel subset A C Mi, 

7(Mi X M2 X .... X Mi_i X Ax Mi+i.... x M„) = fii{A). 

When m = 2, we recover Kantorovich's formulation of the classical optimal 
transportation problem. 

Under mild conditions, a minimizer 7 will exist. Whereas the two marginal 
problem is relatively well understood, results concerning the structure of 
these optimal measures have thus far been elusive for m > 2. Much of the 
progress to date has been in the special case where the Mj's are all Eu- 
clidian domains of common dimension n and the cost function is given by 
c{xi,X2,...,Xm) = Y^ijLj |a;i-Xjp,orequivalentlyc(xi,X2,...,Xm) = -|(Z]i^*)l' 
When n = 3, partial results for this cost were obtained by Olkin and Rachev 
[20J, Knott and Smith [13] and Riischendorf and Uckelmann [22j, before 
Gangbo and Swi§ch proved that for a general m, under a mild regularity 
condition on the first marginal, there is unique solution to the Kantorovich 
problem and it is concentrated on the graph of a function over xi, hence in- 
ducing a solution to a Monge type problem [9j ; an alternate proof of Gangbo 
and Swi§ch's theorem was subsequently found by Riischendorf and Uckel- 
mann [23]. This result was then extended by Heinich to cost functions of 
the form c{xi,X2, ■■■,Xm) = h(J2i^i) where h is strictly concave [10] and, 
in the case when the domains Mj are all 1-dimensional, by Carlier [3] to 
cost functions satisfying a strict 2-monotonicity condition. More recently, 
Carlier and Nazaret [5] studied the related problem of maximizing the deter- 
minant (or its absolute value) of the matrix whose columns are the elements 
xi,X2, ...,x„ G M"; unlike the results in [9], [10] and [3], the solution in this 
problem may not be concentrated on the graph of a function over one of 
the Xj's and may not be unique. The proofs of many of these results ex- 
ploit a duality theorem, proved in the multi-marginal setting by Kellerer 



Although this theorem holds for general cost functions, it alone says 
little about the structure of the optimal measure; the proofs of each of the 
aforementioned results rely heavily on the special forms of the cost. 

The support of 7, which we will denote by spt{-j), is defined as the smallest 
closed subset of Mi x M2 x ... x Mm of full mass. It is natural to ask 
what this set looks like. When m = 2 and the cost function satisfies a 
twist condition, the solution 7 is unique and is contained in the graph of 
a function from Mi to M2, provided the first marginal is suitably regular; 
this function then solves the original problem posed by Monge [15] [7] p^ [8] p] . 
Assuming Mi and M2 are both C^ smooth manifolds of common dimension 
n, the present author, together with McCann and Warren, has shown that 
under a related non- degeneracy condition on c, spt^j) must be contained in 
an n-dimensional Lipschitz submanifold of Mi x M2 [H]. For a general m, 
there is an immediate lower bound on the Hausdorff dimension of spt['y)] 
as spt{j) projects to spt{^i) for all z, dim(spt(7)) > maxj(dim(spt(/ii))). 
In the present manuscript, we establish an upper bound on dim(spt(7)). 
This bound depends on the cost function; however, it will always be greater 
than the largest of the rij's. In the case when the nj's are equal to some 
common value n, we identify conditions on c that ensure our bound will 
be n and we show by example that when these conditions are violated, the 
solution may be supported on a higher dimensional submanifold and may 
not be unique. In fact, the costs in these examples satisfy naive multi- 
marginal extensions of both the twist and non-degeneracy conditions; given 
the aforementioned results in the two marginal case, we found it surprising 
that higher dimensional solutions can exist for twisted, non-degenerate costs. 
On the other hand, if the support of at least one of the measures /ij has 
Hausdorff dimension n, the remarks above imply that spt{'y) must be at least 
n dimensional; therefore, in cases where our upper bound is n, the support 
is exactly n-dimensional, in which case we show it is actually n-rectifiable. 

A striking development in the theory of optimal transportation over the 
last 15 years has been its interplay with geometry. Recently, the insight 
that intrinsic properties of the solution 7, such as the regularity of Monge 
solutions, should not depend on the coordinates used to represent the spaces 
has been very fruitful. The natural conclusion is that understanding these 
properties is related to tensors, or coordinate independent quantities. The 
relevant tensors encode information about the way that the cost function 
and the manifolds interact. For example, Kim and McCann |13j introduced 
a pseudo-Riemannian form on the product space, derived from the mixed sec- 



ond order partial derivatives of the cost, whose sectional curvature is related 
to the regularity of Monge solutions; they also noted that smooth solutions 
must be timelike for this form. 

Unlike the results of Gangbo and Swi§ch, Heinich and Carlier, our con- 
tribution does not rely on a dual formulation of the Kantorovich problem; 
instead, our method uses an intuitive c-monotonicity condition to establish 
a geometrical framework for the problem. The question about the dimension 
of sptlj) should certainly have a coordinate independent answer. Indeed, 
inspired partially by Kim and McCann, our condition is related to a family 
of semi-Riemannian metricqj; heuristically, spt{j) must be timelike for these 
metrics and so their signatures control its dimension. From this perspec- 
tive, the major difference from the m = 2 case is that with two marginals, 
the metric of Kim and McCann always has signature {n,n). In the multi- 
marginal case, there is an entire convex family of relevant metrics, generated 
by 2"^"^ — 1 extreme points, and their signatures may vary depending on the 
cost. 

Like our work in [T7] and in contrast to the results of Gangbo and Swi§ch 
[Hj, Heinich (TU], and Carlier [3], our results here only concern the local 
structure of the optimizer 7 and cannot be easily used to assert uniqueness 
of 7 or the existence of a solution to an appropriate Monge type problem. 
On the other hand, we do explicitly exhibit fairly innocuous looking cost 
functions which have high dimensional and non-unique solutions and so it is 
apparent that these questions cannot be resolved in the affirmative without 
imposing stronger conditions on c. We address these problems in a separate 
paper [2T] . 

The manuscript is organized as follows: in Section 2, we prove our main 
result. In Section 3 we apply this result to several example cost functions; 
many of these are the costs studied by the authors mentioned above and we 
discuss how they fit into our framework. In Section 4, we discuss conditions 
that ensure the relevant metrics have only n timelike directions, which will 
ensure spt^-j) is at most n-dimensional. In Section 5, we discuss some ap- 
plications of our main result to the two marginal problem and in the final 
section we take a closer look at the case when the marginals all have one 
dimensional support. 



"'^For the purposes of this paper, the term semi-Riemannian metric will refer to a sym- 
metric, covariant 2-tensor (which is not necessarily non-degenerate). The term pseudo- 
Riemannian metric will be reserved for semi-Riemannian metrics which are also non- 
degenerate. 



Acknowledgments: The author is indebted to his advisor Robert Mc- 
Cann, for originally suggesting this problem to him, for many fruitful dis- 
cussions and for his continuous support. He would also like to thank the 
members of his PhD supervisory committee, Jim Colliander and Bob Jer- 
rard, for providing constructive feedback, and Man-Duen Choi and Chandler 
Davis for useful discussions concerning the signatures of symmetric matrices. 

2 Dimension of the support 

Before stating our main result, we must introduce some notation. Suppose 
that c G C^(Mi X M2 X ... X M^). Consider the set P of all partitions of the 
set {1, 2, 3, ..., m} into 2 disjoint, nonempty subsets; note that P has 2™"^ — 1 
elements. For any partition p & P, label the corresponding subsets p+ and p_; 
thus, p+Up_ = {1, 2, 3, ..., m} andp+np_ isempty. For each pG P, define the 
symmetric bi-linear form gp = Yljap+Mv- ^;7l^°r(^^? ^^^fc* +^^r ®dx^j') 
on Ml X M2... X Mm, where, in accordance with the Einstein summation 
convention, summation on the at and a,- is implicit. 



zs 



Definition 2.1. We will say that a subset S of Mi x M2 x ... x Mm 
c-monotone with respect to a partition p if for all y = {yi,y2, ■■■,ym) (ind 
y = {yi, m, •••, ym) m S we have 

c{y) + c{y) <c{z) + c{z), 

where 

I ifiep+, 

^ i ifiep- 

and 

I ifiep-, 

i ifiep+. 

The following lemma, which is well known when m = 2, provides the link 
between c-monotonicity and optimal transportation. 

Lemma 2.2. Suppose 7 is an optimizer and C{'~f) < 00. Then the support 
of '-f is c-monotone with respect to every partition p E P. 



Proof. Define Mp_^ = ®i,^pj^Mi and Mp_ = ®ii^p_Mi. Note that we can 
identify Mi x M2 x ... x Mm witli Mp^ x Mp_ and let 7^.^ and 7p_ be tlie 
projections of 7 onto Mp^ and Mp_ respectively. Consider tfie two marginal 
problem 

inf / c{xi,X2,...,Xm)d\, 



where the infinum is taken over all measures A whose projections onto Mp^ 
and Mp_ are 7p^ and 7p_, respectively. Then 7 is optimal for this prob- 
lem and, as c is continuous, the result follows from c-monotonicity for two 
marginal problems; see for example [21]. D 



We will say a vector v G Ti^xi,x2,...,x„^)^i ^ ^2 x ... x Mm is space- 
like (respectively timelike or lightlike) for a semi-Riemannian metric g if 
g{v,v) > (respectively g{v,v) < or g{v,v) = 0). We will say a subspace 
V ^ 7'{a:'i,x2,.--,i'm)^i ^ ^2 X ... X M^ is spacelike (respectively timelike or 
lightlike) for g if every non-zero t" G y is spacelike (respectively timelike or 
lightlike) for g. We will say V is non-spacelike (respectively non-timelike) for 
(7 if no f G V^ is spacelike (respectively timelike). We will say a submanifold 
of T(^xi,x2,...,xm)Mi ^ ^2 X ••• X Mm is spacclike (respectively timelike, light- 
like, non-spacelike or non-timelike) at (xi,X2, ■■■,Xm) if its tangent space at 
{xi,X2, ■■■,Xm) is spacelike (respectively timelike, lightlike, non-spacelike or 
non-timelike). 

We are now ready to state our main result: 

Theorem 2.3. Let g be a convex combination of the gp's; that is g = 
J2peP^p9p where < tp < 1 for all p & P and Ylip&p^p — ^- Suppose 7 is an 
optimizer and C{-^) < 00; choose a point (xi,a;2, ■■■,Xm) G Mi XM2X ...xM^. 
Let N = J^iLi'iT'i- Suppose the (-1-,— ,0) signature of g at {xi,X2, ■■■,Xm) is 
(g+, g_, N—q+ — q_) (ie, the corresponding matrix has q+ positive eigenvalues, 
g_ negative eigenvalues and a zero eigenvalue with multiplicity N — q^ — q_). 
Then there is a neighbourhood O of {xi,X2, ■■■,Xm) such that the intersec- 
tion of the support of 7 with O is contained in a Lipschitz submanifold of 
dimension N — q^. Wherever the support is smooth, it is non-spacelike for 
9- 



Before we prove Theorem 12. 3[ a few remarks are in order. The roughly 
says that the dimension of spt{'^) is controlled by the signature of any convex 
combinations of the ^fp's; as these metrics may have very different signatures, 



we are free to pick the one with the fewest timehke directions to give us the 
best upper bound on the dimension of sptlj) for a particular cost. When 
m = 2, there is only one partition in P and consequently there is only 
one relevant metric, g^^^^-^^dx'^^ ® dx2^ in local coordinates. The matrix 
corresponding to this metric is the block matrix studied by Kim and McCann: 



G 




D^ c 



D^ c 



92 c 



9a: .-' dx,^ 



Here -D^ .j, c is the rij by n^ matrix whose (a^, ttfc)th entry is 

For m > 2, in the remainder of this paper we will focus primarily on 
the special case when tp = ^^_\_j^ for all p E P. To distinguish it from the 
metrics obtained by other convex combinations of the Qp^s, we will denote the 
corresponding metric by g. Note that the matrix of 'g in local coordinates is 
the block matrix given by 



im— 2 



G 



>m-l 






Dl^ c 


Die . 


. Dl ^ 6 




X\X2 


xixs, 


X\Xm 


Dl ,. c 





Dl ^ c . 


. Dl ^ c 


'^ X-i,X\^ 




X2X3'^ 


^X2Xm. 


Dl^ c 


Dl ,. c 





. Dl ^ c 


X-iXl"^ 


XiX2^ 




-^X^Xm 


Dl . c 


D^ c 


D^ c . 


• • • 1 




^m^l 


XmX2^ 


XmX3^ 





Let us note, however, that other choices of the tp's can give new and useful 
information. For example, suppose we take tp to be 1 for a particular p and 
for all others. As in the proof of Lemma 2.2, we can identify Mi x M2... x 
Mm = Mp^ X Mp_ , where Mp^ = ®j(zp^Mj and c{xi, X2, ■.■,Xm) = c{xp^, Xp_) 
where Xp^ G Mp^. In this case, G will take the form: 



G 







Dl 



'•P+ 







The signature of this g is (r, r, N — 2r) where r is the rank of the matrix 
Dl ^ c. Letting Up^ = J2jep± '^i ^^ ^^^ dimension of Mp^ , we will have 
r < minfrin , , n„ ). If it is possible to choose a partition so that Up^ 



N 



''P+ ; ^p- 



rir. 



and Dl ^ c has full rank, we can conclude that spt{'y) is at most 



N 

dimensional. As we will see later, the number of timelike directions for 'g 
may be very large and so this bound may in fact be better. 

Our proof is an adaptation of our argument with McCann and Warren 
in [T7]. When m = 2, after choosing appropriate coordinates, we rotated the 



coordinate system and showed that c-monotonicity imphed that the solu- 
tion was concentrated on a Lipschitz graph over the diagonal, a trick dating 
back to Minty fl8\. When passing to the mult i- marginal setting, however, 
it is not immediately clear how to choose coordinates that make an anal- 
ogous rotation possible; unlike in the two marginal case, it is not possible 
in general to choose coordinates around a point (xi,X2, ...,Xm) such that 
Dl.^c{xi,X2, ■■■,Xm) = I for all i ^ j. The key to resolving this difficulty 
is the observation that Minty's trick amounts to diagonalizing the pseudo- 
metric of Kim and McCann and that this approach generalizes to m > 3. 

Proof. Choose a point x = (xi,a;2, ■■■,Xm) € Mi x M2 x ... x Mm ■ Choose 
local coordinates around Xj on each Mj and set Aij = Dl,^,c{xi,X2, ...,Xm)- 
For any e > 0, there is a neighbourhood O of (xi, X2, ..., Xm) which is convex 
in these coordinates such that for all (2/1, 2/2, •••, 2/m) G O we have \\Aij — 
DliXjC{yi:y2, ■•■,2/m)|| < e, for all i ^ j. 

Let G be the matrix of g' at x in our chosen coordinates. There exists 
some invertible N hj N matrix U such that 



UGU^ = H :-- 



where the diagonal J, — / and blocks have sizes determined by the signature 
oi g. 

Define new coordinates in O by m := Uy, where y = (?/i, 2/21 •••5 2/m) and 
let u = {ui,U2,U3) be the obvious decomposition. We will show that the 
optimizer is locally contained in a Lipschitz graph in these coordinates. 

Choose y = (?/i, 2/2, •••, ?/m) and y = {yi,y2, ■.■,y~m) in the intersection of 
spti^j) and O. Set Ay = y — y. Set z = {zi, Z2, ■■■Zm) where 



/ 


0" 





-/ 









Similarly, set z 




Zi = 



yi iiiep-, 
yi Hie p+. 



Lemma 12.21 then implies 

ciy) + c{y) < c{z) + c{z) 



or 

• 1 /.I 



J2 i^y^fDl.Ay^s, t))Aykdtds < 0, 

^0 ^0 ^.g 

where 



j&p+,'i-&p- 



This imphes that 

Y^ (At/ifA,A?/i<e Y^ Ay^■^yj. 

jep+,iep- jep+,i<^p- 

Hence, 

J2tp Y i^yifAjAyj<eJ2ip Y ^y^-^yr 

pGP j&P+,i&P- P&P j&P+A&P- 

But this means 

{AyfGAy<eJ2tp Yl ^y^'^yr (1) 

p&P j€p+,i£p- 

With Au = UAy and Au = (A-Ui, A-U2, A-us) being the obvious decomposi- 
tion, this becomes: 

\Aui\^ - \Au2\^ = {AufHAu = {AyfGAy 

<^Y^p Yl ^y^' ^y^ 

p€P jep+,iep~ 

3 

<em'\\U-'\\'y\A 

i 

where the last hne follows because for each i and j we have 

\Ay,\\Ay,\ < \Ay\' 

< \\U-'\\'\Au\' 

= ||f/-i|p^|AM,p. 
1=1 



W^l^ 



Choosing e sufficiently small, we have 



1 ^ 



Rearranging yields 



Together with Kirzbraun's theorem, the above inequality implies that the 
support of 7 is locally contained in a Lipschitz graph of ui over U2 and W3. 

If spt{'y) is differentiable at x, the non-spacelike implication follows from 
taking y = x in ([1]), then noting that we can take e — t- as y — )■ x. 

D 



3 Examples 



In this section we apply Theorem 12.31 to several cost functions. Throughout 
this section, the dimensions of the Mj are all equal to some common n and 
we will restrict our attention to the special semi-Riemannian metric g defined 
in the last section. 

Example 3.1. Suppose Mi = W^ for alii and that c{xi,X2, ■■■,Xm) = ^(Xllii ^« 
where D^h < 0; this is the form of the cost function studied by Gangho and 
Switch J^ and Heinich IW^ (actually Heinich made the slightly weaker as- 
sumption that h is strictly concave). Then, up to a positive, multiplicative 
constant; we have 



G 



D^h D^h 

D'^h D'^h 

D^h D^h 

D^h D^h D^h 







If V is an eigenvector of D^h with eigenvalue X, then 

[v, f , V, v...v,v]'^ 



10 



is an eigenvector of G with eigenvalue (m — 1)A and 

[v,-v,0,0,...,Of 
[v,0,~v,0,...,Of 



[v,0,0,0,...,-vf 

are linearly independent eigenvectors with eigenvalue —A. It follows that 
the signature of 'g is {{m — l)n,n, 0) and the solution is contained in an n 
dimensional submanifold; this is consistent with the results of Gangbo and 
Switch and Heinich, who show that if the first marginal assigns measure zero 
to every set of Hausdorff dimension n — 1, then spt{'~^) is contained in the 
graph of a function over Xi . 

Example 3.2. Suppose c(xi, X2, ..., Xm) = ^(^^i Xi), but now assume D^h > 
0. Then the signature of g is {n,n{m — 1),0). Furthermore, we can show 
that any measure supported on the n{m — 1) dimensional surface 



S — {{Xi, X2, ■■■, Xm)\ / ^ Xi — yj, 



i=l 



where y G M" is any constant, is optimal for its marginals. Indeed, adding a 
function of the form '^^^Ui^Xi) to the cost c shifts the functional C{j) by 
an amount Yl^i Im '^ii^d'^f'i f^'^ each 7 but does not change its minimizers. 
In particular, minimizing the cost c is equivalent to minimizing 



m m 



c'{xi,X2,...,x„i) := c{xi,X2,...,x„i) -2^Xi ■ Dh{y) = /(> ^Xj), 

i=l i=l 

where f{z) := h{z) — z ■ Dh{y). Then f is a strictly convex function whose 
gradient vanishes at z = y; it follows that y is the unique minimum of f. 
Hence, c'{xi,X2, .■■,Xm) < /(?/) with equality only when J2^i^i — V- ^^ 
follows that any measure supported on S is optimal for its marginals. 

Example 3.3. Let c(xi,a;2, ..., Xm) = h(Yl™^^Xi), but now suppose the sig- 
nature of D'^h is {q, n — q,0). Then the signature ofgis (g + (m — 1) (n — 
q),n — q + q{m — 1), 0), and in fact we will find an optimal measures whose 
support has dimension {n — q + q{m — 1)). 

11 



At a fixed point p, we can add an affine function of (xi + X2 + ... + Xm) 
so that Dh{p) = and choose variables so that 



D^h{p) 



I ■ 

-J 



where the top left hand corner block is q by q and the bottom left hand cor- 
ner block is n — q by n — q. Then define the q-dimensional variables yt = 
(xj, x^, ..., x^) and the n — q dimensional variables zi = (x^"*" , x^"*" ,...,x^), 
so that h{J2^i^i) = ^(Xli^i ?/«' Silli -2«)- Now, near p, the implicit func- 
tion theorem implies that for fixed Zi,i = 1,2, ...,m there is a unique K = 

m m 

DyhiKiY,z,),J2z,) = 

i=l i=l 

and K is smooth as a function of ^^^ Zi. As h is convex in it's first slot 
near p, 

mm mm 

Mi^( J] 2,), J] z,) < MJZ y., E ^^) 

for all nearby yt. Now, if we f{YA=i ^i) = KK{Yh=i ^i)y YlT=i ^i) ^hen f is a 
concave function ofY^JLi -Sj- If we consider an optimal transportation problem 
for the Zi with cost f , the solution must be concentrated on a Lipschitz n — k 
dimensional submanifold. Choose an n — q dimensional set S which supports 
an optimizer for this problem; by considering a dual problem as in Gangbo and 
Switch JBI, we can find functions Ui{zi) such that fiYl^i ^i)~Yl^i '^ii^i) ^ 
with equality if and only if {zi, Z2, ■■■Zm) G S . Therefore, 

m rn rn m -m m 

i=l i=l i=l i=l i=l i=l 

and we have equality only when (^i, Z2, ■■■Zm) G -S" and YlT=i Vi ~ ^iYllLi ^t); 
which is a n — q + {m — l)q dimensional set. It follows that this set is the 
support of an optimizer for appropriate marginals. 

Example 3.4. Chiappori, McCann and Nesheim 16] and Carlier and Eke- 
land [41 studied a hedonic pricing model involving a multi-marginal opti- 
mal transportation problem with cost functions of the form c(xi, X2, ..., x^) = 
iriiy^YY.T=ifii^i^y))- Assume: 

12 



1. Y is a C^ smooth n-dimensional manifold. 

2. For all i, fi is C^ and the matrix Dl.yfi of mixed, second order partials 
derivatives is everwhere non-singular 

3. For each (xi, 0:2, ..., Xm) the infinum is attained by a unique y{xi, X2, ■■■,Xm) G 
Y and 

4- Yl'iLiDyyfi{xi,y{xi,X2,...,Xm)) is non-singular. 

Fixing {xi, X2, ■.-, Xm), we can choose coordinates so that Dl,yfi{xi, y{xi, X2, ..., x^)) = 

I for alii. Now, Y17LiDyfi{xi,y{xi,X2, .■.,Xm)) = 0. Set M = YlT=iDlyfi{xi,y{xi,X2, ...,Xra)) 

and note that as M is non-singular by assumption we must have M > 0.. 

The implicit function theorem now implies that y is differentiable with respect 

to each Xj and: 

m 

5Z ^lyM^i^ 2/(^1' ^2, •■•, Xm))D^^y{xi, X2, ..., Xm)+Dl^Jj{xi, y{xi, X2,..., Xm)) = 0. 
j=l 

So D^^y{xi,X2,...,Xm) = -M-^. Now, as c(xi, X2, ..., x^) < Y.'iLi fi{xi,y)) 
with equality when y = y{xi, X2, ..., Xm) we have 

D^^c{Xi,X2,...,Xm) = D^J{Xi,y{Xi,X2,...,Xm))- 

Differentiating with respect to Xj yields 

Dl^x,c{xi,X2, ..., Xm) = D^^yf{xi, y{xi, X2, ..., Xm))D^^y{xi, X2, ..., Xm) = -M~^ 

for all i 7^ j. Hence, the signature of 'g is {{m — l)n,n,0), by the same 
argument as in Example 1. 

Example 3.5. The problem studied by Carlier and Nazaret in /J/ is equiv- 
alent to the case where m = n and the cost function is the —1 times the 
determinant; ie, /or xi, X2, , ..., Xn G R"', c(xi,X2, ...,x„) is —1 times the de- 
terminant of the n by n matrix whose ith column is the vector Xj. When 
n = 3, they exhibit a specific example where the solution has 4 dimensional 
support; specifically, it's support is the set 

S = {(xi,X2,X3) : |xi| = |x2| = IX3I and (xi,X2,X3) forms a direct orthogonal basis for M."^}. 

13 



We show that g has signature (5,4,0) on S. Choose (xi, 2:2, 3:3) in the 
support; after applying a rotation we may assume Xi = (|a;i|, 0, 0), X2 = 
(0, |xi|,0) and X3 = (0,0, |xi|). A straightforward calculation then yields: 
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-10 0-1000 

There are 5 eigenvectors with eigenvalue 1: 

[010100000]^ 

[001000100]^ 

[000001010]^ 

[1000-10000]^ 

[10000000-1]^. 

There are 3 eigenvectors with eigenvalue -1: 

[010-100000]^ 

[001000-100]^ 

[0000010-10]^. 

Finally, there is a single eigenvalue with eignenvector -2: 

[100010001]^ 

Example 3.6. (Non-uniqueness) Our final example demonstrates that when 
the dimension of spt{'~^) is larger than n, the solution may not he unique. Set 
m = 4 and c(x, y, z, vS) = h{x + y + z + w) for h strictly convex. Suppose all 
four marginals /ij are Lebesgue measure on the unit cube /" in M". Let Si 
be the surface y = —w + (1, 1, 1, ..., 1), z = —x + (1, 1, 1, ..., 1) and take 71 
be uniform measure on the intersection of Si with I"- x I^ x I"' x I^. This 
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projects to fii for i = 1, 2, 3 and 4 and by the argument in Example 2, it must 
be optimal. Now, if we take S2 to be the surface y = —x + (1,1,1,...,!), 
z = —w + (1, 1, 1, ..., 1) and 72 be uniform measure on the intersection of S2 
with I"' X I'^ X !"• X /", we obtain a second optimal measure. 

It is worth noting that this cost is twisted: the maps Xi 1— )■ D^jCi^xi, X2, --Xm) 
are injective for all i y^ j where Xk is held fixed for all k ^ i. In the two 
marginal case, the twist condition and mild regularity on the ^i suffices to 
imply the uniqueness of the solution 7 IW^ : this example demonstrates that 
this is no longer true for m > 3. 



4 The Signature of g 

This section is devoted to developing some results about the signature of the 
semi-metric g = J2peP^p9p ^^ some point x = (xi,X2, ...,Xm)- Studying the 
signature at a point reduces to understanding the matrix 



G 



G12 G23 
G21 G23 
G31 G32 



Gmi G 



m2 



G 



m3 



Glm 

G2m 

Gsrn 

. . . , 





Here, for i ^ j,Gij = aijDl_^.c where aij = ^tp and the sum is over 
all partitions p & P that separate i and j; that is, i & p^ and j G p- or 
i & P- and j G p+. One observation about the signature of the matrix G is 
immediate; as G has zero blocks on the diagonal, it is possible to construct 
a lightlike subspace of dimension Umax = maxj{nj}. This in turn implies 
that the number of spacelike directions can be no greater than N — rimax] 
otherwise, it would be possible to construct a spacelike subspace of dimension 
N — Umax + 1; whlch would have to intersect non trivially with the null 
subspace. Therefore, the best possible bound on the dimension of spt{'y) 
that Theorem 12.31 can provide is Umax- This result is not too surprising. 
We have already noted that for suitable marginals, the Hausdorff dimension 
of spt^j) must be at least Umax] the discussion above verifies that this is 
consistent with Theorem 12. 3[ 

Throughout the remainder of this section we will assume the dimensions 
rii of the manifolds Mi are all equal to some common n. Then G is an nm 
by nm matrix; however, we show here that because of its special form, its 
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signature can be computed from lower dimensional data. For example, when 
m = 2 the signature will always be (n, n) and, as we will see, when m = 3 
it is enough to calculate the signature of an appropriate n hj n matrix. We 
will refer to the signature of g as {q+, g_, nm — q+ — q^) 

The first proposition gives an upper and lower bound for the number of 
timelike directions. 

Proposition 4.1. Suppose rank{Gij) = r for some i ^ j ■ Then q+,q- > r. 
In particular, if Gij is invertible for some i ^ j, the support of j is at most 
(m — l)n dimensional. 

Proof. On the subspace T^^Mi x T^ Mj G restricts to 

■ Gij 
G,j 

Note that (f , u) is a null vector if and only if u is in the null space of Gij and 
V is in the nuUspace of Gji. As both of these spaces are n — r dimensional, 
the nullspace of this matrix is 2{n — r) dimensional. 

As has been noted by Kim and McCann |T3], the nonzero eigenvalues of 
this matrix come in pairs of the form A, —A, with corresponding eigenvectors 
{v.,u) and (v,— -u), respectively, where we take A > [13]. Therefore, there 

are -(2n — 2(n — r)) = r positive eigenvalues and as many negative ones. 

We can now construct a r dimensional timelike subspace for (7. If g+ < r, 
then we could construct a non-timelike subspace of dimension mn — q+ > 
nm — r (for example, take the space spanned by all negative and null eigen- 
values of G). These two spaces would have to intersect non trivially as their 
dimensions add to more than nm, which is a contradiction. An analagous 
argument applies to q_. 

D 

Next, we describe the signature in the m = 3 case: 

Lemma 4.2. Suppose m = 3 and that the mixed second order partials are all 
'-^125 Gi3, and G23 non-singular. Set A = (j'i2(G'32)~^G'3i; suppose A + A^ has 
signature {r^,r^,n — rj^ — r^). Then g has signature ((/+ , g_ , 3n — g+ — g_ ) = 
(n + r_ , n + r_(_ , n — r_|_ — r _ ) . 
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Proof. By changing variables in X2 and X3, we may assume G12 = 6*13 = /. 
In these coordinates, G32 = A~^. Consider the subspace 

S = {(0,p, q):pe T,,M2, q G T^.M^,}. 

By Proposition 14.11 we can find an orthonormal basis for this subspace con- 
sisting of n spacehke and n timehke directions. To determine the signature of 
g then, it suffices to consider the restriction off^ to the orthogonal complement 
(relative to g) S-^ of S; any orthonormal basis of S^ can be concatenated 
with a basis for S to form an orthonormal basis for T^^Mi x T.j..^M2 x T^^^M^. 
A simple calculation yields that S-^ = {{v, —A^v, —Av) : v G T^-jMi} and 

g{{v, -A^v, -Av), {v, -A^v, -Av)) = -{A + A^)(v, v), 
which yields the desired result. D 

In particular, if A is negative definite, g has signature {2n, n, 0) and the 
support of any minimizer has dimension at most n. 

A brief remark about Lemma 14.21 is in order. We mentioned in section 2 
that, while there is only one interesting pseudo metric when m = 2, there is an 
entire family of metrics in the m > 3 setting which may give new information 
about the behaviour of spt^j). However, when m = 3, D^.^cis non singular 
for all i 7^ j, and the coefficients Ojj are all non zero, the signature of g is 

determined entirely hj A = Gi2(G32)"^G'3i = -^^-^ Dl^^^c{Dl^^^c)-^ Dl^^^c. 

032 
Choosing a different g simply changes the ajj's, which does not effect the 

signature oi A + A^ . If one of the aj^'s is zero, it is easy to check that the 

signature of g must be (n, n, n); this yields a bound of 2n on the dimension 

of spt'^ which is no better than the bound obtained when all the ai/s are 

non-zero. Thus, when m = 3 the only information about the dimension of 

spt{'^) which can be provided by Theorem 12. 3l is encoded in the bi-linear form 

Dl^ADl.,c)-^Dl.A^) on T^,M, x T,,M,. 

When m > 3, Lemma [4.21 easily yields the following necessary condition 

for the signature of G to be {{m — l)n, n, 0): 

Corollary 4.3. Suppose the signature of g is {{m — l)n,n,0). Then 
for all distinct i,j and k. 
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Proof. Note that the Gjj's must be invertible (and hence D^.^c must be 
invertible and Ojj > 0) ; otherwise, the argument in Proposition l4.ll imphes 
the existence of a non-spacehke subspace of T^^Mi x T^ Mj whose dimension 
is greater than n. The signature of G ensures the existence of a (m — l)n 
dimensional spacehke subspace, however, and so these two spaces would have 
to intersect non-trivially, a contradiction. 



Similarly, if D^^ c(D 



X i^ Xj 



^Dl ^ c was not negative definite, we could 



use Lemma S]2]to construct a non-timelike subspace of T^^ Mi x T^ Mj x Tx^. Mk 
of dimension greater that n; this, in turn, would have to intersect our {m—l)n 
dimensional timelike subspace, which is again a contradiction. D 

The method in the proof above can be extended to give us a method 
to explicitly calculate the signature of g for larger m when a certain set of 
matrices are invertible. 

For I = 2, 3, ..., m, let Gi be the lower right hand corner In by In block of 
G: 



Gi 





Gm-l+2,m~l+l 
Gm-l+3,m-l+l 



G 



m^m—l+l 



Gm-l+l,m-l+2 



Gm~l+3,m-l+2 



G 



m,m—l+2 



Gm-l+l,m-l+3 
Gm-l+2,m-l+3 





G 



m,m— 1+3 



^m—l+l,m 

^m—l+2,m 

^m—l+3,m 

. . . , 





Lemma 4.4. Suppose Gi has signature (g, In — q, 0) Let G\a he the i,jth block 
of the inverse of Gi. and consider the n by n matrix ^iG\jD'^jij^^c. Suppose 
this matrix has signature (r_|., r^,n — r^ — r^ 
is {q + r^,ln — q + r+,n — r^^- — r^). 



Then the signature of G, 



i+i 



For an algorithm to calculate the signature in the general case, start with 
the lower right hand two by two block, which has signature (n,n,0). Use 
Lemma 14. 4[ or equivalently Lemma 14.21 to find the signature of the lower 
right hand three by three block. Then use Lemma 14.41 again to determine 
the signature of the lower right hand four by four block and so on. After 
m — 1 applications of Lemma [4.41 we obtain the signature of g. 



5 Applications to the two marginal problem 

Together with McCann and Warren, we proved in jlT] that any solution to the 
two marginal problem was supported on an n-dimensional Lipschitz subman- 
ifold, provided the marginals both live on smooth n-dimensional manifolds 
and the cost is non-degenerate; that is, Dl^^^c{xi,X2) seen as a map from 
Tx^Mi to T*^M2 is injective. Kim and McCann noted that in this case, the 
signature of 'g is (n,n, 0) [ISj, so Theorem 12.31 immediately implies this re- 
sult. In fact, our analysis here is applicable to a larger class of two marginal 
problems. 

Unfortunately, the topology of many important manifolds prohibits the 
non- degeneracy condition from holding everywhere. Suppose, for example, 
that Ml = M2 = S^, the unit circle. Then periodicity in xi of ■^{xi,X2) 
implies 

/ o a {xi,X2)dxi = 0. 

It follows that for every X2 there is at least one xi such that q^ q^ (3:1,3:2) = 0. 
In [T7] , we noted that under certain conditions the set where non- degeneracy 
fails is at most 2n — 1 dimensional, which yields an immediate upper bound 
on the dimension of spt{'y). We now use Theorem 12.31 to derive a better 
bound. To this end, suppose that we have two n dimensional manifolds and 
the non-degeneracy condition fails at some point {xi, X2)- If r is the rank of 
the map Dl_^^^c{xi, X2), then the signature of g at (xi, X2) is (r, r, 2n — 2r). 
We conclude that locally spt{-y) is at most 2n — r dimensional. A global lower 
bound on r immediately yields an upper bound for the dimension of spt{'y) 
Next we consider a two marginal problem where the dimensions of the 
spaces fail to coincide; suppose the two manifolds Mi and M2 have dimen- 
sions rii and n2 respectively, where n2 < Ui. Again, let r be the rank of 
Dxix2^'^ then g has signature (r, r, ni + ^2 — 2r). If Dl_^^^c has full rank, ie, 
if r = 77-2 then this reduces to {n2, n2,ni — 'n.2) and the solution may have as 
many as ni dimensions (in fact, if the support of the first marginal has Haus- 
dorff dimension rii, then the Hausdorff dimension of spt{'j) must be exactly 
rii). This result has a nice heuristic explanation. To solve the problem, one 
would first solve its dual problem, yielding two potential functions Ui{xi) 
and U2{x2), and the solutions lies in the set where the first order condition 
Du2{x2) = Dx^c{xi,X2) is satisfied. For a fixed X2, this is a level set of the 
function xi i-)- Dx2c{xi,X2), which is generically ni — n2 dimensional. Fixing 
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X2 and moving along this level set corresponds exactly to moving along the 
null directions of g. On the other hand, as X2 varies, xi must vary in such a 
way so that the resulting tangent vectors are timelike. Hence, the solution 
may contain all the lightlike directions of 'g, which correspond to fixing X2 
and varying Xi, plus n2 timelike directions, which correspond to varying X2 
and with it Xi. 

6 The l-dimensional case: coordinate inde- 
pendence and a new proof of Carlier's re- 
sult 

In |3] , Carlier studied a multi-marginal problem where all the measures were 
supported on the real line and proved that under a 2-monotonicity condition 
on the cost, the solution must be one dimensional. To the best of our knowl- 
edge, this is the only result about the multi-marginal problem proved to date 
that deals with a general class of cost functions. The purpose of this sec- 
tion is to expose the relationship between 2-monotonicity and the geometric 
framework developed in this paper. We will find an invariant form of this 
condition and provide a new and simpler proof of Carlier's result. 
We begin with a definition: 

Definition 6.1. We say c : M™ — t- M zs i,j strictly 2-monotone with sign ±1 
and write sgn{c)ij = ±1 if for all x = (xi,X2, ■■■,Xm) G M™ and s,t > we 
have 

±[c{x) + c{x + tCi + SCj)] < ±[c(x + tCi) + c{x + SCj)] 

where (ci, 62, ■■■Gm) is the canonical basis for M™. 

In this notation, Carlier's 2-monotonicity condition is that sgn{c)ij = — 1 
for all i y^ j- This is not invariant under smooth changes of coordinates, 
however; the change of coordinates Xi 1— )■ —Xi takes a cost with sgn{c)ij = —1 
and transforms it to one with sgn{c)ij = 1. However, it is easy to check that 
the following condition is coordinate independent. 

Definition 6.2. We say c is compatible if, for all distinct i,j, k we have 

sgn{c)ijsgn{c)jk _ _^ 
sgn{c)ik 
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It is also easy to check that c is compatible if and only if there exist smooth 
changes of coordinates Xi \-^ i/i = fi{xi) for i = l,2,...,m which transform 
c to a 2-monotone cost. Combined with Carlier's result, this observation 
implies that compatibility is sufficient to ensure that the support of any 
optimizer is 1- dimensional. 

If the cost is C^, the condition ^^ J^ < is sufficient to ensure sgn{c)ij = 

—1: likewise, ,'^f i j'^!i )~^ j'^f < ensures that c is compatible. We can 

' ' dxidxj ^ axf^dxj ' dxidxf; '^ 

think of the condition on the threefold products Dl_^^^c{Dl^^^c)~^ Dl^^^c in 
Lemma 14.21 as a multi-dimensional, coordinate independent version of Car- 
lier's condition. Corollary 14.31 demonstrates that this condition is necessary 
for ^ to have signature ((m — l)n, n, 0) and, when m = 3, Lemma [4.21 shows 
that it is also sufficient. For ttt, > 3, however, it is not sufficient even in one 
dimension. As a counterexample, consider the cost function 

c{xi, X2, X3, X4) = — a;ia;2 — Xix^, — Xix^ — X2X3 — X2Xi — 8x3X4. 

For this cost. 



G 



which has signature (2,2,0) 

Thus, Theorem 12.31 implies neither Carlier's result nor the generalization 
above , at least if we restrict our attention to the special metric g. Below, we 
reconcile this by providing a new proof of Carlier's result, with the slightly 
stronger assumption ^^,^^, < in place of 2-monotonicity. 

The crux of Carlier's argument is the following result: 

Theorem 6.3. Suppose -^-j^ < for all i ^ j ■ Then the projections 
of the support of the optimizer onto the planes spanned by Xi and Xj are 
non- decreasing subsets. 

In view of the preceding remarks, this implies that when the cost has neg- 
ative threefold products ■'^f i ^'^!^ )~^ ^'^j , the support is 1-dimensional. 

^ dxidxj ^ dx^dxj ' dxidxk^ -^^ 

Carlier's proof relies heavily on duality. He shows that he can reduce 
the problem to a series of two marginal problems with costs derived from 
the solution to the dual problem. He then shows that these cost inherit 
monotonicity from c and hence their solutions are concentrated on monotone 
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sets. We provide a simple proof that uses only the c-monotonicity of the sup- 
port. In addition, our proof does not require any compactness assumptions 
on the supports of the measures. However, after establishing this result, it 
is not hard to show that, if the first measure is nonatomic, the support is 
concentrated on the graph of a function over xi. 

Morally, our proof applies the non-spacelike conclusion of Theorem l2.3l to 
a well chosen semi-metric; however, because we don't know a priori that the 
optimizer is smooth we will prove the theorem directly from c-monotonicity. 

Proof. Suppose (xi, ..., x^) and {yi, ..., ym) belong to the support of the op- 
timizer. We want to show (xi — yi){xi — yi) > for all i. If not, we 
may assume without loss of generality that for some 2 < fc < ttt, we have 
(xi — yi){xi — yi) > for alH < A; and {xi — yi){xi — yi) < for i > k. Hence, 
{xj — yj){xi — 2/j) < for all j < k and i > k. By c-monotonicity, we have 

c{Xi, ...,Xm)+c{yi, ...,ym) < c(?/i, ..., ?/fc_i, Xfc, ..., Xm) + c{xi, ..., Xfc-l, ?/fc, •••,2/m)- 

Hence, 



fc-l m ^1 ^1 ,2 



K—l III „]^ „2 

Y^Y^{xi-y,){xj-yj) / / 
i=l j=k -^0 Jo 



(Pc 

{yi{t),y2{t), ..., yk-i{t),yk{s), ..., ym{s))dtd. 



HjJbj^iJjJb -j 



:s 



< 



where yi{t) =yi + t(xi - yi) for i = l,2,..k-l and yj(s) = yj + s(xj - yj) for 
j = k,k + l,...,m. But, as ^^^c(yi(t), ?/2(t), ..., y/c - l(t),2/fc(s), ..., ^^(s)) < 
0, and {xi — yi){xj — yj) < for all i < k and j > k, every term in the sum is 
nonnegative. As (xi — yi){xj — yj) < for j > k, the sum must be positive, 
a contradiction. 

D 
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